Introduction

 

Protein methyltransferases (MTases) are the enzymes, which are meant for methylation of protein (Boriack-Sjodin and Swinger 2016). The SET domain MTases catalyze the reaction between a protein substrate and S-adenosyl-L-methionine (SAM), yielding a methylated protein and S-adenosyl- L-homocysteine (SAH). S-adenosyl-L-methionine (SAM) acts as an important cofactor for the transfer of the methyl group to biological molecules like DNA, RNA and proteins (Petrossian and Clarke 2011). SET domain MTases are a new family of methyltransferases, which specifically methylate lysine residues of a large number of different proteins (Yeates 2002). SET domain family was named after the Drosophila genes in which it was first discovered; Su (var), Enhancer of zeste, and Trithorax (Jenuwein et al. 1998). Later on, it was revealed that these genes encode histone lysine methyltransferases (Rea et al. 2000; Nishioka et al. 2002). SET domain proteins have now been found in all eukaryotic organisms. The domain, which is approximately 130 amino acids long, was characterized in 1998 (Dillon et al. 2005).

At the molecular level, RKM4 involves in the transcription cofactor activity, histone methyltransferase activity, zinc ion, and tetra-pyrrole binding, oxidoreductase activity and acts on other nitrogenous compounds as donors. On the other hand, RKM4 also plays an important role in many biological processes, which include; Organ development, negative regulation of cellular macromolecule biosynthetic process and RNA metabolic process, negative regulation of gene expression, role in histone methylation, generation of precursor metabolites and energy, establishment of localization and peptidyl-lysine mono-methylation (Yang and Zhang 2015; Zhang et al. 2017).

Methylation is an extremely important post-translational modification (PTM) of proteins. Methylation has been widely studied concerning the “histone code” gene expression regulation (Black et al. 2012). The recent studies show that the methylation of protein also has a potential role in the non-histone proteins (Erce et al. 2012; Low and Wilkins 2012; Clarke 2013). Dozens of methylation sites have been explored in Ascomycetes (Plank et al. 2015; Yagoub et al. 2015) and a hundred and thousand sites in human through the methylproteome enrichment studies (Bremang et al. 2013; Guo et al. 2014).

The current study is focused on the investigation of genetic variations in the RKM4 gene of different strains of S. fimicola, collected from “Evolution Canyon”, Israel. The “Evolution Canyon”, Israel has two contrasting slopes: south-facing slope (SFS) and the north-facing slope (NFS). It presents a microscale environment for the study of genetic variations in different organisms due to its diverse environmental conditions (Nevo 2012). Genetic variations are caused by recombination, spontaneous mutations, gene conversion, and environmental stress. These are key driving factors for evolution and species adaptation (Hoffmann and Hercus 2000; Saleem et al. 2001). The strains of S. fimicola from the south-facing slope (SFS) bear a high frequency of mutations and gene conversion than the strains from the north-facing slope (NFS), (Arif et al. 2019; Jamil et al. 2019). Therefore, the SFS strains undergo more genetic variations than the NFS strains of S. fimicola (Saleem et al. 2001). The fact is that SFS has xeric, harsh conditions and NFS has mild environmental conditions (Nevo 2012). So environmental stress is a major cause of genetic variations (Saleem et al. 2001). Genetic variations happening in the DNA ultimately pass into the proteins and then these effect post-translational modifications of proteins. The differences in the position of modified sites in the same protein among different strains of S. fimicola and in the S. cerevisiae are the reflections of genetic variations (Arif et al. 2017a, b). Another purpose of this study is to predict the possible post-translational modifications, 3D structures, and functions of RKM4 protein. Very little work has been done on post-translational modifications of the RKM4. To bridge this knowledge gap, we have used different bioinformatics tools to investigate post-translational modifications in this study. Although the bioinformatics tools are reliable, there is a need to study the post-translational modifications of the RKM4 protein experimentally to authenticate this study.

 

Materials and Methods

 

Sub-culturing of experimental organism

 

The Molecular Genetics Laboratory of Department of Botany, University of the Punjab, Lahore, provided the stock cultures of parental strains of S. fimicola. Originally, these strains were isolated from “Evolution Canyon”, Israel by Prof. Nevo’s Colleagues. The S1, S2, S3 strains collected from the south-facing slope (SFS) and the N5, N6, N7 strains were collected from the north-facing slope (NFS). The sub-culturing of these strains was carried out on PDA (potato dextrose agar) media under sterile conditions. The mature fungal growth was obtained after 9 days by incubating the samples at 20℃ in the refrigerated incubator.

 

Extraction of genomic DNA

 

Genomic DNA from all parental strains of S. fimicola was extracted by the modified Spano et al. (1995) method (without a phenolic wash) followed by resolving DNA fragments by 1% agarose gel electrophoresis stained with 0.3µl ethidium bromide. 1 kb DNA ladder was used and the gel was photographed under UV light in the gel documentation system. Afterward, primers specific to the RKM4 region were designed using the Primer-BLAST tool from the NCBI server (https://blast.ncbi.nlm.nih.gov/Blast.cgi) to amplify the RKM4 gene in all strains of S. fimicola.

 

Touchdown PCR conditions for amplification of RKM4 gene

 

Touchdown PCR (TD-PCR) conditions were used for the amplification of genes to study the possible genetic variations and potential sites for post-translational modifications of different strains of S. fimicola. 15 μL PCR reaction mixture was composed of 2 μL DNA sample, 1 μL forward primer, 1 μL reverse primer, 10 μL 2X Amp Master Mix, and 1 μL ddH2O. PCR took109 minutes and 40 cycles for the complete amplification of the gene. The time required for each step of the PCR and other conditions are given in Table 1. The amplified product was resolved at 1% agarose gel electrophoresis followed by visualization under UV light in the gel documentation system and the PCR product was sent to Macrogen Korea for sequencing. Afterward, the sequences translated into protein sequences by EMBOSS Transeq online server (https://www.ebi.ac.uk/Tools/st/emboss_transeq/Protein).

 

3D structures prediction and visualization

 

I-TASSER was used to predict the 3D structures and functions of RKM4 proteins. The confidence of each model is quantitatively measured by the C-score value that is calculated on the base of the significance of threading template alignments and the convergence parameters of the structure assembly simulations. The protein structures were visualized in PyMol molecular system. The ligand-protein interaction predictions were carried out by using BioLip, which is a ligand-protein binding database.

 

Tools used for prediction of post-translational modifications

 

Different online bioinformatics tools were used for the prediction of post-translational modifications. The PMes Server (bioinfo.ncu.edu.cn/inquiries_PMeS.aspx) was used for the prediction of methylation at lysine and arginine residues and NetPhos 3.1 Server (http://www.cbs.dtu.dk/services/NetPhos/) for the prediction of phosphorylation at threonine (T), tyrosine (Y) and serine (S) residues. The PAIL (http://bdmpail.biocuckoo.org/prediction.php) and the NetNES Servers (http://www.cbs.dtu.dk/services/NetNES/) were used for the prediction of acetylation at arginine (R) residues and nuclear export signals, respectively.

Results

 

Multiple sequence alignment

 

The extracted DNA from different strains (S1, S2, S3, N5, N6, and N7) of Sordaria fimicola were subjected to amplification of the RKM4 gene using touchdown PCR conditions. RKM4 regions with 900 base pairs length were amplified in all studied strains of S. fimicola. After sequencing, the sequences of the RKM4 gene of different strains of S. fimicola were aligned with S. cerevisiae (reference strain) by online clustal omega alignment tool to observe genetic variations among different strains of S. fimicola.

We obtained 12 different polymorphic sites in the RKM4 regions of six strains of S. fimicola with respect to the S. cerevisiae. Out of 12 polymorphic sites, six non-synonymous substitutions were observed in the RKM4 region. Non-synonymous substitutions are those substitutions, which change the coding amino acid. At first polymorphic site in the SFS strains, T was substituted with A at the second base of a codon, resulting in the change of ATC codon into AAC, which changed the Isoleucine (I) into asparagine (N). At a second polymorphic site in the S2, S3 and N5 strains, A was substituted with G, resulting in the change of codon from GAG to GGG, which changed the encoding amino acid from glutamate (E) to glycine (G). At the third site in SFS strains, AT was substituted with CG, changed the codon from GAT to GCG and changed the encoded amino acid from aspartate (D) to alanine (A). In fourth polymorphic site in NFS strains, T was replaced with A at first base of the codon, where TTT is converted into ATT and changed the amino acid from phenylalanine (F) to isoleucine (I). In SFS strains at fifth polymorphic site, G was substituted with A at third base of the codon (ATG-ATA), resulted in the change of methionine (M) into isoleucine (I). At tenth site in the S3 strain, the substitution of T with A at second base of codon was occurred (TTT-TAT), which substituted the tyrosine (Y) with phenylalanine (F). Other polymorphic sites did not change the coding amino acids, hence known as synonymous substitutions (Fig. 1–2).

 

Analysis of 3D structures and ligand-protein interactions

 

Table 1: Touch Down PCR conditions

 

Stage 1

Step

Temperature (°C)

Time

1

Denaturation

95

3 min

2

3

Denaturation

Annealing

95

Tm + 10

30 s

45 s

4

Elongation

72

60 s

Repeat steps 2-4 for 15 times

Stage 2

Step

Temperature (°C)

Time

5

Denaturation

95

30 s

6

Annealing

Tm or (Tm – 5)

45 s

7

Elongation

72

60 s

Repeat 5-7 steps for 25 times

Termination

Step

Temperature (°C)

Time

8

Elongation

72

5 min

9

Stop reaction

4

15 min

10

Hold

23

Until removed from machine

 

The 3D cartoon models of RKM4 protein for S. cerevisiae and S. fimicola are shown in Fig. 3. The motifs shown in red color are α-helix, motifs in yellow color are β-sheets and motifs in green lines are expressing coils. Both 3D protein structures are different at loop regions and have a difference in coiling. The ligand-protein interaction is shown at 3D models of protein with ligand binding site residues for S. cerevisiae and S. fimicola in Fig. 4–5, respectively. S. cerevisiae has three ligands; SAM, Zn+2 and (R, R)-Butane-2, 3-diol, and each of the ligands has its binding site residues. SAM has binding site residues; E80, G81, L82, S221, R222, D239, L240, I241, N242, H243, Y287, Y300, and F302; while the Y287 provides a catalytic binding site. The binding site residues of Zinc are; C65, C68, H86, and C90. (R, R)-Butane-2, 3-diol has four binding site residues; Y41, Y54, C55 and T220 (Fig. 4a–f).

S. fimicola has two SAM and lysine ligands. SAM binding site residues include; V72, A73, G74, Y75, A222, Y223, D248, I249, L250, N251, H252, Y285, Y297, and F299; while lysine has six binding site residues; A222, S224, F225, Q226, Y285, and Y297 (Fig. 5a–d).

 

Prediction of post-translational modifications

 

Prediction of phosphorylation: For the RKM4 protein of S. cerevisiae, phosphorylation was predicted at 26 serine (S), 18 threonine (T) and 7 tyrosine (Y) residues at different sites in the amino acid sequence. Phosphorylation was observed at 14 serine (S), 8 threonine (T) and 6 tyrosine (Y) residues of S1 and S2 strains. Phosphorylation for S3 and N5 strains was found at 13 serine (S), 7 threonine (T) and 8 tyrosine (Y) residues. For

 

Fig. 1: Multiple sequence alignment of different strains of S. fimicola with respect to the S. cerevisiae to observe genetic diversity among different strains of S. fimicola for RKM4 gene

Keywords: Symbol (*) showing fully conserved sites, space and highlighted regions showing polymorphic sites

 

N6 strain, it was predicted at 13 serine (S), 6 threonine (T) and 7 tyrosine (T) residues. 13 serine (S), 6 threonine (T) and 8 tyrosine (Y) residues of N7 strain were phosphorylated (Table 2).

Prediction of methylation: In this study, it is reported that only arginine residues of RKM4 have undergone methylation. Methylation at six arginine residues (R98, R213, R243, R388, R390, and R445) was investigated in S. cerevisiae. Only one arginine residue R62 was found to have the potential for methylation in all studied strains of S. fimicola (Table 3).

Prediction of acetylation: In S. cerevisiae, acetylation was investigated at 18 lysine (K) residues. In the S1, S2 and S3 strains of S. fimicola, acetylation was found at five lysine residues and seven sites in the N5, N6 and N7 strains (Table 3).

Prediction of nuclear export signals (NES): We have reported two nuclear export signals at positions 359L, 140L and three nuclear export signals at positions 60L, 276I, 279I of RKM4 in S. cerevisiae and S1, S2 and S3 strains, respectively. In N5, N6 and N7 strains, four NES sites (53L, 60L, 275I, and 278I) were observed. 60 L residue has been  found common for all strains of S. fimicola and other sites are present in close proximity with respect to one another. This shows that the 60 L site is conserved in all strains of S. fimicola (Table 3).

 

Discussion

 

To the best of our knowledge, the RKM4 gene is first time reported in S. fimicola. In the current study, genetic variations investigated in the RKM4 gene of S. fimicola. The SFS strains have nine polymorphic sites and the NFS strains have three polymorphic sites (Fig. 1). Non-synonymous substitutions were observed at six sites in the RKM4 region, resulted in the change of coding amino acid (Fig. 2). As SFS have xeric and more stressful conditions, due to this SFS strains have more genetic variations than NFS strains. This reveals that environmental stress has a role in the creation of genetic variations as reported by other geneticists. Arif et al. (2017a) investigated polymorphism in the S. fimicola with the help of SSR marker and identified that SFS strains have more variations than NFS strains. Hosid et al. (2008) reported high levels of polymorphism with the help of SSR marker in the soil fungus Emericella nidulans from a stressful environment and low levels of polymorphism in the fungus collected from an arid environment. Moreover, genetic variations help the species to better survive in the fluctuating and stressful environment and are major causes of evolution (Hoffmann and Hercus 2000; Saleem et al. 2001). Environmental conditions encounter organisms with natural selection by manipulating parental and genetic variants and thus genetic variations become a requirement for evolution as they determine the evolutionary potential of a population (Arber 2000).

 

 

Fig. 2: Multiple sequence alignment of amino acid sequence of RKM4 protein of different strains of S. fimicola with respect to the reference strain S. cerevisiae amino acid sequence to spot genetic diversity

Symbol (*) showing conserved sites, space and highlighted regions showing polymorphic sites

 

On the other hand, genetic variation also has its reflection on post-translational modifications of proteins as described by Arif et al. (2017b) in Frequency Clock and Mating Type a-1 proteins of parental strains of S. fimicola from EC Israel. Therefore, the difference in the positions of modified sites for the same protein in the different strains of S. fimicola is the reflection of genetic variations (Table 2 and 3). Genetic variations of nucleotide sequence ultimately translated into the protein and this produces the proteins with unique PTM sites, which lead towards the functional diversity of proteins. Thus, post-translational modifications (PTMs) are very important because they change the configuration of proteins and this affects their catalytic functions. Therefore, it is necessary to study them to see how the PTMs play their role in maintaining the biological functions of proteins (Lothrop et al. 2013). There are many types of PTMs of proteins, while the present study mainly focused upon phosphorylation, methylation, and acetylation of RKM4 methyltransferase of S. cerevisiae and S. fimicola.

Methyltransferases are involved in important biological processes and methylate specific lysine (Lys) and N-terminal residues of different subunits of the translational machinery (Porras-Yakushi et al. 2007; Lipson et al. 2010; Hamey et al. 2016). SAM-dependent MTase, RKM4 mono-methylates 60S ribosomal protein L42 (RPL42A and RPL42B) at 'Lys-55' (Webb et al. 2008; Lipson et al. 2010). A second SET domain methyltransferase Rkm2 is also identified, which is responsible for tri-methylating the ribosomal protein L12ab (Rpl12ab) at lysine 10. The second site of lysine methylation for Rpl12ab at position 3 by RKM2 is identified (Porras-Yakushi et al. 2006; Webb et al. 2008; Gardner et al. 2011).

The RKM4 has many important molecular and biological functions, which were predicted by I-TASSER in this study (Yang and Zhang 2015; Zhang et al. 2017). Uslupehlivan et al. (2018) predicted the 3D structure and functions of the prion protein of sheep (Ovis aries) by I-TASSER. Likewise, Rong et al. (2019) used I-TASSER to predict the membrane-spanning helices and topology models for Δ17 fatty acid desaturases from Rhizophagus irregularis and Octopus bimaculoides.

Methylation predominantly found on lysine (Lys) and arginine (Arg) residues in eukaryotes (Clarke, 2013). In the present study, methylation was predicted at arginine residues of RKM4 MTase of S. cerevisiae and S. fimicola. The six-arginine residues of RKM4 MTase of S. cerevisiae have the potential for methylation. Only one site (R62) was predicted for RKM4 protein, which is common for the SFS and the NFS strains of S. fimicola (Table 3). Winter et al. (2017) observed methylation at 528th lysine residue of the RKM4 protein of S. cerevisiae using mass spectrometry.

Protein phosphorylation plays an important role in the regulatory and signaling processes of the cell. This affects up to 30% of the proteome and essential in the regulation of cellular functions, protein degradation, and stabilization. In addition, phosphorylation networks are also essential backbones of the communication system within cells (Manning et al. 2002; Ficarro et al. 2002). In the present study, a total 51 phosphorylated sites were investigated for the RKM4 protein of S. cerevisiae (Table 2). Winter et al. (2017) have identified 3 serine phosphorylation sites at S24, S420, and S446 positions and one threonine phosphorylation site at 480 amino acid position of RKM4 MTase of S. cerevisiae. Serine modification at S420 and threonine modification at T480 was also predicted in the present study beside other sites in S. cerevisiae (Table 2).

 

Fig. 3: 3D structure models of RKM4 protein of (a) S. cerevisiae (b) S. fimicola generated by I-TASSER and visualized by PyMol. Motifs shown in red color indicate α-helix, yellow indicate β-sheet and motifs shown in green represent coil structure

 

 

Fig. 4: Ligand-protein interaction of RKM4 protein of S. cerevisiae. RKM4 protein has three ligands; SAM and lysine generated from BioLip (ligand-protein binding database). (a). SAM ligand attachment with protein. (b). SAM ligand with its binding site residue, while Y287 is catalytic site residue (c). Zinc ligand attachment with protein. (d). Lysine ligand with its binding site residues. (e). (R-R)-Butan-2,3-diol attachment with protein. (f). (R-R)-Butan-2,3-diol with its binding site residues shown in violet color

In S1, S2, and S3 strains of S. fimicola, phosphorylation reported at 28 sites. For N6 and N7 strains of S. fimicola, 26 and 27 sites were predicted, respectively (Table 2). Zhu et al. (2001) identified 4,200 phosphorylation events affecting 1,325 proteins from the 87 yeast protein kinase assays by the use of proteome chip technology. Bodenmiller et al. (2010) studied two serine phosphorylation sites at 67 and 69 positions. Winter et al. (2017) studied phosphorylation at two other serine residues at 129 and 573 positions. Albuquerque et al. (2008) also reported phosphorylation at one serine residue at 573 amino acid position experimentally for RKM1 of S. cerevisiae.

 

Table 2: Phosphorylation predicted sites with their protein kinases for RKM4 protein of S. cerevisiae and different strains of S. fimicola. Numbers in third column are showing the phosphorylation positions on serine, threonine and tyrosine residues of RKM4. The numbers in the others columns (last four) are showing the positions, where the specific protein kinase involved in the phosphorylation of its respective residue i.e. serine, threonine, and tyrosine

 

Organism

Residues

Phosphorylation sites

Protein kinases

CKII

Unsp

PKC

PKA

 

 

 

 

 

S. cerevisiae

Serine

5, 24, 59, 60, 63, 67, 75, 86, 118, 137, 158, 172, 187, 189, 197, 198, 208, 270, 337, 420, 425, 429, 446, 450, 485, 486

Total = 26 sites

5, 189, 197, 198, 207, 208, 337, 450

24, 63, 75, 86, 158, 172, 198, 207, 208, 420, 425, 429, 446, 485, 486

63, 75, 158, 187, 446, 486

59, 60, 137, 429

Threonine

17, 44, 52, 65, 84, 171, 207, 232, 240, 303, 306, 308, 396, 401, 431, 457, 472, 480

Total = 18 sites

17, 84, 171, 308, 431

171, 232, 396, 457, 480

44, 52, 240, 401, 472

-

Tyrosine

217, 241, 261, 278, 286, 338, 341

Total = 7 sites

-

217, 261, 286, 341

241, 278

-

 

 

 

 

S1, S2

Serine

6, 38, 57, 78, 87, 92, 107, 109, 117, 118, 128, 190, 204, 257

Total = 14 sites

109, 117, 118, 128, 190, 257

6, 92, 118, 128, 204

78, 107

38, 57

Threonine

4, 91, 127, 152, 160, 223, 226, 228

Total = 8 sites

4, 127, 226, 228

91, 152

160

-

Tyrosine

137, 161, 181, 199, 206, 261

Total = 6 sites

-

137, 161, 181, 198, 206, 261

-

-

 

 

 

 

S3, N5

Serine

6, 38, 57, 78, 92, 107, 109, 117, 118, 128, 190, 204, 256

Total = 13 sites

109, 117, 256

6, 92, 118, 128, 204

78, 107

38, 57

Threonine

4, 91, 127, 152, 160, 225, 227

Total = 7 sites

4, 227

91, 127, 152

160

-

Tyrosine

100, 137, 161, 181, 196, 206, 257, 260

Total = 8 sites

225

137, 161, 181, 198, 206, 260

-

-

 

 

 

 

N6

Serine

6, 38, 57, 78, 92, 107, 109, 117, 118, 128, 190, 204, 256

Total = 13 sites

109, 117, 118, 128 190, 256

6, 92, 118, 128, 204, 78

78, 107

38, 57

Threonine

4, 91, 127, 152, 160, 225

Total= 6 sites

4, 91, 127, 225

91, 127, 152

160

 

Tyrosine

100, 137, 181, 196, 206, 257, 260

Total = 7 sites

-

137, 161, 181, 198, 206, 260

-

-

 

 

 

 

N7

 

Serine

6, 38, 57, 78, 92, 107, 109, 117, 118, 128, 190, 204, 256

Total = 13 sites

109, 117, 118, 128, 190, 256

6, 78, 92, 118, 128, 204

78, 107

38, 57

Threonine

4, 91, 127, 152, 160, 225

Total = 6 sites

4, 91, 127, 225

91, 127, 152

160

-

Tyrosine

100,137, 161, 181, 198, 206, 257, 260

Total = 8 sites

-

137, 161, 181, 198, 206, 260

-

-

 

Table 3: Table showing predicted methylation, acetylation and nuclear export signals (NES) sites for RKM4 protein of S. cerevisiae and all studied strains of S. fimicola. Numbers are showing methylation positions on arginine (R), acetylation positions on lysine (K) and NES positions on leucine (L) and isoleucine (I)

 

Organism

Residue

Methylation sites

Residue

Acetylation sites

NES sites

NES potential

S. cerevisiae

Arginine (R)

98, 213, 243, 388, 390, 445

Total = 6 sites

Lysine (K)

30, 46, 49, 77, 146, 149, 219, 234, 247, 368, 403, 423, 426, 445, 484,488, 493, 494

Total = 18 sites

359-L

0.575

140-L

0.529

S1, S2, S3

Arginine (R)

62

Total = 1 site

Lysine (K)

66, 69, 139, 154, 167

Total = 5 sites

60-L

0.624

276-I

0.507

279-I

0.562

N5, N6, N7

Arginine (R)

62

Total = 1 site

Lysine (K)

66, 69, 139, 154, 167, 220, 287

Total = 7 sites

53-L

0.514

60-L

0.646

275-I

0.505

278-I

0.561

Symbols L=Leucine and I=Isoleucine

Protein kinases are the most crucial enzymes, involve in protein phosphorylation. They transfer a phosphate group from ATP to the protein substrate and phosphorylate the protein. They constitute about 2% of eukaryotes genome and phosphorylate about 30% cellular proteins (Ubersax and Ferrell 2007). CKII, Unsp, PKC, and PKA are some important protein kinases, which are potentially involved in phosphorylation at different residues of RKM4 protein of S. fimicola and S. cerevisiae (Table 2). Arif et al. (2019) reported four protein kinases (PKC, Unsp, PKA, cdc2),  which are involved in the phosphorylation of Cytochrome c oxidase (COX1).

 

 

Fig. 5: Ligand-protein interaction of RKM4 protein of S. fimicola. RKM4 protein has two ligands; SAM and lysine generated from BioLip (ligand-protein binding database). (a). SAM ligand attachment with protein. (b). SAM ligand with its binding site residues, while H252 is catalytic site residue (c). Lysine ligand attachment with protein. (d). Lysine ligand with its binding site residues

 

Lysine acetylation is a reversible post-translational modification of proteins and plays a key role in regulating gene expression (Choudhary et al. 2009). Protein lysine acetylation has emerged as a key post-translational modification in cellular regulation, particularly through the modification of histones and nuclear transcription regulators (Zhao et al. 2010). For the RKM4 protein of S. cerevisiae, acetylation predicted at 18 lysine (K) residues. In the S1, S2 and S3 strains of S. fimicola, acetylation observed at five lysine residues and in the N5, N6, and N7 strains, seven acetylation sites were reported (Table 3). The difference in the acetylation sites for RKM4 protein lies between the strains of two contrasting slopes of EC. There was no difference in between the strains of each slope. Winter et al. (2017) studied the two lysine sites at 403 and 488 positions in the RKM4 protein of S. cerevisiae and these sites are also identified in all strains of S. fimicola. This shows the conservation of both sites and these sites might be involved in the regulation of RKM4 protein.

Nuclear export signals (NESs) play an extremely important role in the regulation of the subcellular location of proteins. Other nuclear processes and transcription are affected by this regulation. These processes are very important in maintaining the viability of the cell. The most important properties of NESs are accessibility and flexibility allowing relevant proteins to interact with the signal (Cour et al. 2004). For the RKM4 protein of S. cerevisiae, two positions; 140 L and 359 L have been predicted. Three positions (60-L, 276-I, 79-I) of nuclear export signals for S1, S2, and S3, four positions (53-L, 60-L, 275-I, 278-I) for N5, N6 and N7 have been predicted. One position 60-L is common for all studied strains of S. fimicola (Table 3). The presence of these nuclear export signals in the RKM4 protein of S. cerevisiae as well as in different strains of S. fimicola indicates that this protein is primarily regulated through these nuclear export signals. Arif et al. (2017a) predicted nuclear export signals in the frequency clock protein of S. fimicola at 328th amino acid residue and in Neurospora crassa at 323rd amino acid residue. Furthermore, some recent studies have been carried out by Jamil et al. (2018) and Arif et al. (2019) on the post-translational modifications of H3/H4 histones and cytochrome c oxidase (COX1) of S. fimicola, respectively by using different bioinformatics tools, also used in this study. These studies and current study will help to bridge the knowledge gap related to the post-translational modifications of proteins in S. fimicola.

 

Conclusion

 

It is concluded that SFS strains have more genetic variations than NFS strains because SFS strains have stressful environmental conditions. These genetic variations also have their reflections upon post-translational modifications of proteins. Therefore, in this study different post-translational modified sites are reported for RKM4 methyltransferase of S. fimicola. RKM4 has an important role in molecular and biological processes, which are predicted by I-TASSER. However, experimental studies need to authenticate these functions and to clarify the unknown functions of phosphorylation, methylation, and acetylation.

 

References

 

Albuquerque CP, MB Smolka, SH Payne, V Bafna, J Eng, H Zhou (2008). A multidimensional chromatography technology for in-depth phosphoproteome analysis. Mol Cell Proteomics 7:1389‒1396

Arber W (2000). Genetic variation: molecular mechanisms and impact on microbial evolution. FEMS Microbiol Rev 24:1‒7

Arif R, SH Bukhari, M Ishfaq, MG Shahid, SF Lee, M Saleem (2019). Genetic variation and post-translational modifications of cytochrome c oxidase-1 (COX1) in different strains of Sordaria fimicola. Intl J Agric Biol 21:1055‒1062

Arif R, F Akram, T Jamil, H Mukhtar, SF Lee, M Saleem (2017a). Genetic variation and its reflection on posttranslational modifications in frequency clock and mating type a-1 proteins in Sordaria fimicola. Biol Med Res Intl 2017; Article 1268623

Arif R, SF Lee, M Saleem (2017b). Evaluating short sequence markers in S. fimicola by high resolution melt analysis. Intl J Agric Biol 19:248–254

Black JC, CV Rechem, JR Whetstine (2012). Histone lysine methylation dynamics: establishment, regulation, and biological impact. J Mol Cell 48:491‒507

Bodenmiller B, S Wanka, C Kraft, J Urban, D Campbell, PG Pedrioli, B Gerrits, P Picotti, H Lam, O Vitek, MY Brusniak (2010). Phosphoproteomic analysis reveals interconnected system-wide responses to perturbations of kinases and phosphatases in yeast. Sci Signal 3; Article 153

Boriack-Sjodin PA, KK Swinger (2016). Protein methyltransferases: A distinct, diverse, and dynamic family of enzymes. Biochemistry 55:1557‒1569

Bremang M, A Cuomo, AM Agresta, M Stugiewicz, V Spadotto, T Bonaldi (2013). Mass spectrometry-based identification and characterization of lysine and arginine methylation in the human proteome. Mol BioSyst 9:2231–2247

Choudhary C, C Kumar, F Gnad, ML Nielsen, M Rehman, TC Walther, JV Olsen, M Mann (2009). Lysine acetylation targets protein complexes and co-regulates major cellular functions. Science 325:834‒840

Clarke SG (2013). Protein methylation at the surface and buried deep: Thinking outside the histone box. Trends Biochem Sci 38:243–252

Cour TL, L Kiemer, A Molgaard, R Gupta, K Skriver, S Brunak (2004). Analysis and prediction of leucine-rich nuclear export signals. Protein Eng Desig Select 17:527‒536

Dillon SC, X Zhang, RC Trievel, X Cheng (2005). The SET-domain protein superfamily: Protein lysine methyltransferases. Genome Biol 6; Article 227

Erce MA, CN Pang, G Hart-Smith, MR Wilkins (2012). The methylproteome and the intracellular methylation network. Proteomics 12:564–586

Ficarro SB, ML McCleland, PT Stukenberg, DJ Burke, MM Ross, J Shabanowitz, DF Hunt, FM White (2002). Phosphoproteome analysis by mass spectrometry and its application to Saccharomyces cerevisiae. Nat Biotechnol 20:1-6

Gardner KE, L Zhou, MA Parra, X Chen, BD Strahl (2011). Identification of lysine 37 of histone H2B as a novel site of methylation. PLoS One 6; Article e0016244

Guo A, H Gu, J Zhou, D Mulhern, Y Wang, KA Lee, V Yang, M Aguiar, J Kornhauser, X Jia, J Ren (2014). Immunoaffinity enrichment and mass spectrometry analysis of protein methylation. Mol Cell Proteomics 13:372‒387

Hamey JJ, DL Winter, D Yagoub, CM Overall, G Hart-Smith, MR Wilkins (2016). Novel N-terminal and lysine methyltransferases that target translation elongation factor 1A in Yeast and human. Mol Cell Proteomics 15:164–176

Hoffmann AA, MJ Hercus (2000). Environmental stress as an evolutionary force. Biol Sci 50:217–226

Hosid E, I Grishkan, Z Frenkel, SP Wasser, E Nevo, A Korol (2008). Diversity of microsatellites in natural populations of Ascomycetous fungus, Emericella nidulans, in Israel on local and regional scales. Mycol Prog 7:99-109

Jamil T, S Ijaz, R Arif, F Akram, M Saleem (2019). Diversity of Retrotransposons (Ty3-gypsy, LINEs and Ty1-copia) in Sordaria fimicola. Pak J Zool 51:12001202

Jamil T, N Sami, R Arif, Q Rashid, M Saleem (2018). H3/H4 Histone genes variations and its effect on post translational modifications in various strains of Sordaria fimicola. Intl J Agric Biol 20:1021–1026

Jenuwein T, G Laible, R Dorn, G Reuter (1998). SET domains modulate chromatin domains in eu and heterochromatin. Cell Mol Life Sci 54:80–93

Lipson RS, KJ Webb, SG Clarke (2010). Two novel methyltransferases acting upon eukaryotic elongation factor 1A in Saccharomyces cerevisiae. Arch Biochem Biophys 500:137–143

Lothrop AP, MP Torres, SM Fuchs (2013). Deciphering posttranslational modification codes. FEBS Lett 587:1247–1257

Low JK, MR Wilkins (2012). Protein arginine methylation in Saccharomyces cerevisiae. FEBS J 279:4423–4443

Manning G, GD Plowman, T Hunter, S Sudarsanam (2002). Evolution of protein kinase signaling from yeast to man. Trends Biochem Sci 27:514–520

Nevo E (2012). “Evolution Canyon,” a potential microscale monitor of global warming across life. Proc Natl Acad Sci 109:2960‒2965

Nishioka K, S Chuikov, K Sarma, H Erdjument-Bromage, CD Allis, P Tempst, D Reinberg (2002). Set9, a novel histone H3 methyltransferase that facilitates transcription by precluding histone tail modifications required for heterochromatin formation. Genes Dev 16:479–489

Petrossian TC, SG Clarke (2011). Uncovering the human methyltransferasome. Mol Cell Proteomics 10; Article M110.000976

Plank M, R Fischer, V Geoghegan, PD Charles, R Konietzny, O Acuto, C Pears, CJ Schofield, BM Kessler (2015). Expanding the Yeast protein arginine methylome. Proteomics 15:3232‒3243

Porras-Yakushi TR, JP Whitelegge, S Clarke (2007). Yeast ribosomal/cytochrome c SET domain methyltransferase subfamily identification of Rpl23ab methylation sites and recognition motifs. J Biol Chem 282:12368‒12376

Porras-Yakushi TR, JP Whitelegge, S Clarke (2006). A novel SET domain methyltransferase in Yeast Rkm2-dependent trimethylation of ribosomal protein L12ab at lysine 10. J Biol Chem 281:35835‒35845

Rea S, F Eisenhaber, D O'carroll, BD Strahl, ZW Sun, M Schmid, S Opravil, K Mechtler, CP Ponting, CD Allis, T Jenuwein (2000). Regulation of chromatin structure by site-specific histone H3 methyltransferases. Nature 406:1‒7

Rong C, H Chen, X Tang, Z Gu, J Zhao, H Zhang, W Chen, YQ Chen (2019). Characterization and molecular docking of new Δ17 fatty acid desaturase genes from Rhizophagus irregularis and Octopus bimaculoides. RSC Adv 9:6871‒6880

Saleem M, BC Lamb, E Nevo (2001). Inherited differences in crossing over and gene conversion frequencies between wild strains of Sordaria fimicola from “Evolution Canyon”. Genetics 159:1573–1593

Spano P, TM Fulton, J Chunwongesm, SD Tanksley (1995). Extraction of high quality DNA for genome sequencing. Mol Biol Rep 13:207

Ubersax JA, JE Ferrell Jr (2007). Mechanisms of specificity in protein phosphorylation. Nat Rev Mol Cell Biol 8:530-541

Uslupehlivan M, R Deveci, C Ün (2018). In silico investigation of the prion protein glycosylation profiles in relation to scrapie disease resistance in domestic sheep (Ovis aries). Mol Cell Probes 42:1‒9

Webb K, JA Laganowsky, JP Whitelegge, SG Clarke (2008). Identification of two SET domain proteins required for methylation of lysine residues in Yeast ribosomal protein Rpl42ab. J Biol Chem 283:35561–35568

Winter DL, G Hart-Smith, MR Wilkins (2017). Characterization of protein methyltransferase Rkm1, Rkm4, Efm4, Efm7, Set5 and Hmt1 reveals extensive posttranslational modification. J Mol Biol 430:1102‒1118

Yagoub D, G Hart-Smith, J Moecking, MA Erce, MR Wilkins (2015). Yeast Proteins Gar1p, Nop1p, Npl3p, Nsr1p, and Rps2p are natively methylated and are substrates of the arginine methyltransferase Hmt1p. Proteomics 15:3209–3218

Yang J, Y Zhang (2015). I-TASSER server: new development for protein structure and function predictions. Nucl Acids Res 43: 174181

Yeates TO (2002). Structures of SET domain proteins: protein lysine methyltransferases make their mark. Cell 111:5–7

Zhang C, PL Freddolino, Y Zhang (2017). COFACTOR: improved protein function prediction by combining structure, sequence and protein–protein interaction information. Nucl Acids Res 45: 291‒299

Zhao S, W Xu, W Jiang, W Yu, Y Lin, T Zhang, J Yao, L Zhou, Y Zeng, H Li, Y Li (2010). Regulation of cellular metabolism by protein lysine acetylation. Science 327:10001004

Zhu H, M Bilgin, R Bangham, D Hall, A Casamayor, P Bertone, N Lan, R Jansen, S Bidlingmaier, T Houfek, T Mitchell (2001). Global analysis of protein activities using proteome chips. Science 293:2101‒2105